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Abstract 

This paper investigates the common intuition suggesting that dur- 
ing crises the shape of the financial market clearly differentiates from 
that of random walk processes. In this sense, it challenges the analysis 
of the nature of financial markets proposed by Fama and his associates. 
For this, a geometric approach is proposed in order to define the pat- 
terns of change of the market and a measure of multivariate kurtosis is 
used in order to test deviations from multinormality. The emergence 
of crises can be measured in this framework, using all the available 
information about the returns of the stocks under consideration and 
not only the index representing the market. 

1 Introduction 

The Samuelson theory of asset prices (Samuelson, 1965) was the basis of 
the Efficient Market Hypothesis (Fama, 1965, 1970, 1991), assuming that 
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the stock returns are approximated by a random walk and their changes 
are unpredictable. Nevertheless, many scholars criticized the theoretical as- 
sumptions of the EMH, namely its informational requisites (e.g. Grossman 
and Stiglitz, 1980), detected the existence of time dependencies violating 
the conditions of the theory (e.g. Lo and MacKinley, 1988), and identified 
the existence of asymmetry and fat tails in the distribution of returns (e.g. 
Plerou et al, 2001). Consequently, due to these and other discrepancies, the 
EMH "is respected but not worshipped" in financial theory and statistics 
(Pesaran, 2010: 29). 

Tools for analysis of deviations from normality existed for long, e.g. the 
concept of Kurtosis was introduced by Karl Pearson (1905) in order to mea- 
sure the size of the tails of a distribution as compared with those of the 
normal. It was assumed, by Pearson and then by "Student" (1927), who 
suggested a curious mnemonic based on the shape of animals in order to 
describe non-normal distributions, that kurtosis, as well as skewness, are 
common features of nature. 

Fat tails are interpreted as the result of a larger part of the variance being 
provoked by rare extreme events, as compared to the normal distribution. In 
this sense, Pesaran discusses the effect of the recent punctuation of crashes 
- such as the dot-com crash of 2000 and the general financial crash after 
the subprime crisis of 2007 - and argues that periods of bubbles and crashes 
deviate from market efficiency (Pesaran, 2010). This is precisely the intuition 
we pursue in this paper, proposing a new approach to the measurement of 
the dynamics of changes of the distributions representing the stock returns. 
In the following, we describe the emergence of crises amidst long periods of 
normal trading and, as we are interested in major changes that occur at the 
fat tails of the distribution, we use extensions of the concept of kurtosis to 
the realm of a n— dimensional object. 

For this, we proceed as follows: first, we define the concepts used to de- 
scribe the dynamics of the market; the following section presents the geom- 
etry applied to the measurement of the distortion in certain periods; finally 
the next sections describe the results of the statistics applied to relevant 
periods, when the market differentiates itself from a random walk type of 
behavior. 
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2 The identification of the dynamics of change 
of the market 



In the province of statistical research on financial data, the evidence of 
fat tails is expressed in typical leptokurtic distributions. Recently, Pesaran 
proposed an empirical verification of four indexes (S&P, FTSE 100, DAX, 
NIKKEI 225), for 2000-2009, and found evidence of kurtosis, rejecting the 
null hypothesis of a normal distribution using a Jarque-Bera test (Jarque, 
Bcra, 1987). In the same paper, originated as a contribution to a seminar in 
honor of Fama, the author mentions the historical data of the monthly re- 
turns measured by the S&P, as compiled by Shiller (2011): from 1871 to 2009 
there is an impressive evidence of kurtosis, and even when shorter periods 
are chosen, deviations from normality are typical. 

As a consequence of this common empirical evidence of non-normality 
and asymmetric and heavy tailed distributions of financial data, anticipated 
by no less than Fama himself (Fama, 1965, also Dufour et al, 2003), several 
adjusted models for skewness and kurtosis have been proposed in financial 
statistics, namely in the asset pricing applications (e.g., Jarrow and Rudd, 
1982, Corrado and Tsu, 1996, Brown and Robinson, 2002, Vahamaa, 2003, 
Iqbal et al, 2010). In this sense, for instance Dufour, Beaulieu and Khalaf 
proposed the incorporation of asymmetry of the return distribution on asset 
evaluation (Dufour et al, 2003, Beauheu et al, 2005, 2009). 

The problem we address in this paper follows these lines of research but, 
instead of describing the market with the recourse to a single measurement 
of an index, we propose to capture the whole available empirical information 
on the dynamics of a population of stocks through time, considering as a 
consequence the multivariate process. There are sound reasons for this option 
since, in general, the statistical experiments and empirical approaches "are 
multivariate by nature" (Liu et al, 1999: 783). 

By using a stochastic geometry technique, we found that the dynamics of 
the S&P500 set of stocks defines market spaces as low-dimensional entities 
and that this low-dimensionality is caused by the small proportion of sys- 
tematic information present in correlations among stocks in normal periods 
of trade. However, this situation changes dramatically in periods of crashes 
or crises. 

This is verified with extensive data representing different sets of the daily 
returns, namely that of the 236 S&P500 stocks that remained in the market 
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Figure 1: The evolution of the S&P500 for 1973-2009. 

from 1973 to 2009, and then verified by that of the 471 S&P500 stocks sur- 
viving from 2005 to 2009 (see Fig 1). These populations are used to study 
the market dynamics: in both cases, in the subperiods of business- as-usual, 
the geometric object defined by the dynamics of the market approaches the 
spherical configuration, typical of a Gaussian distribution; conversely, when 
a subperiod includes relevant crashes, the shape of that geometric object is 
distorted, acquiring prominences in some particular directions. Moreover, 
we found that, during crashes, market spaces contract along their effective 
dimensions. In this, we also follow a definition by R.A. Fisher (1953), estab- 
lishing that, measured in an Euclidean space, the multivariate normal errors 
are described by the surface of a sphere, and suggesting that, whenever large 
errors occur, the topological deviations have to be considered. 

In order to capture the contracting and distortion effects in the mar- 
ket shape, we measure multivariate kurtosis {b2,p) as presented in the next 
section. 

3 The measure of the distortion of the space 
of the market 

The strategy of measurement of the space of the financial market is simply 
stated in the following terms. From the set of returns of the stocks and their 
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historical data of returns over the time interval, and using an appropriate 
metric (Mantegna, 1999, 2000), we compute the matrix of distances between 
the stocks. Considering the returns for each stock, 

r{k) = log{pt{k)) - log{pt-i{k)) (1) 
a normalized vector 

is defined, where n is the number of components (number of time labels) 
in the vector With this vector the distance between the stocks k and / is 
defined by the Euchdean distance of the normahzed vectors. 



dM = V2 (1 - Cij) = \\-tik) - ^(011 (3) 
with Cki being the correlation coefficient of the returns r{k),r{l). 

r (^(fe)7^(0) - {1^{k)) (7^(0) 

As the distance is properly defined according to the due metric axioms, 
it is possible to obtain, from the matrix of distances, the coordinates for 
the stocks in a Euclidean space of dimension smaller than N. The standard 
analysis of reduction of the coordinates is applied to the center of mass and 
the eigenvectors of the inertial tensor are then computed. 

The same technique is also applied to surrogate (time-permuted and ran- 
dom) data, namely to data obtained by independent time permutation for 
each stock, and these eigenvalues are compared with those obtained from 
actual data in order to identify the characteristic directions for which the 
eigenvalues are significantly different. They define a reduced subspace of di- 
mension /, which carries the systematic information related to the market 
correlation structure. 

This corresponds to the identification of empirically constructed variables 
that drive the market and, in this framework, the number of surviving eigen- 
values is the effective characteristic dimension of this economic space (/). 
This procedure is the key for the following method, since it allows for the 
consideration of populations of hundreds of stocks, given that only a very 
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Figure 2: The market spaces along their main 3 dimensions: each point 
describes a firm, for surrogate data and business-as-usual periods (the upper 
plots) correspond to a spherical shape while in periods of crises (the plots in 
the bottom) proeminences in the market shape are obvious. 



small number of coordinates describing their distances is used in the compu- 
tation of our measures of the multivariate space. 

This suggests the definition of a systematic covariance. For this, we 
denote by ~^{k)^f^ the restriction of the A;— asset to the subspace Vf. and 
by d)^{ the distances restricted to this space. Then using Eqs.(3) and (4) we 
may define a notion of systematic covariance a^i 

where ^ik = |^(A;)(/)|/|l^(fc)| , = (^{k)) and a^k = {l^{k)l^{k)) . 

As Figure 2 clearly indicates, in periods of normal trade the spherical 
configuration is maintained, similar to that of surrogate data, whereas in 
periods of turbulence new shapes emerge. 

The plots of Figure 2 show that while in the first days of January 2007 
there is no relevant difference in relation to surrogate (time-permuted) data. 
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new shapes emerge in March 2007 as weU as in November 2008, two periods 
of strong turbulence in the financial markets, indicated by a distortion in the 
shape of the sphere. 



4 A Test Based on Multivariate Kurtosis 

Given this evidence of contraction and distortion of the market space in dif- 
ferent periods, we proceed to test deviation from normality. The computation 
of univariate measures and tests of deviations of general indexes from nor- 
mality is a common procedure (e.g., Pesaran, 2010). Instead, we propose to 
consider the available information on the richest detail of a large population 
of stocks. For this, we recur to heuristic concepts and a test of multivariate 
kurtosis (62, p), such as proposed by Mardia (1970). In this case, multivariate 
kurtosis is defined as 

h,p{t) = j^E,[{z,{t)-z){a(f^)-\z,{t)-z)Y (6) 

where a^^^ is the systematic covariance, p is the number of variables and 
N the number of observations. In the calculation of o"*^-^-* / = 6 since six 
was found (Araujo and Louca, 2007, 2008) to be the number of effective 
dimensions of the S&P500 market space. 

Although the statistical properties of tests of multivariate normality are 
not as established as those applied to univariate normality (Mardia et al, 
1979, Gnanadesikan, 1997), a large body of literature was built in the last 
decades on the topic. Mardia (1970, 1974, 1983), using Arnold's results 
(Arnold, 1964) proposed the affine invariant measures as previously discrim- 
inated, estabhshed their asymptotic distributions and formulated tests for 
the null of multivariate normality. Different scholars discussed the limit dis- 
tributions of Mardia's tests (Schwagcr, 1985, Baringhaus and Hcnzc, 1992, 
Kariya and George, 1995, Zhao and Konichi, 1997) and investigated their 
consistency (Baringhaus and Henze, 1988, Henze, 1994). Others, as Koziol 
(1982, 1983, 1993, 2005), Srivastava (1984) and Henze (1990), proposed al- 
ternative approaches to test multivariate normality. 

Several authors surveyed the available procedures and tests of multivari- 
ate normality and identified more than fifty alternatives, although only some 
qualified as generally accepted methods (Horswell and Looney, 1992, Looney, 
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1995, Mecklin and Mundfrom, 2004, 2005, Szekely and Rizzo, 2005, Farrell 
et al, 2006). Through the comparison of the power of the different tests, 
using extensive Monte Carlo simulations, some of these authors argued that 
the Mardia tests have low power (Farrell et al, 2006) in particular against 
the BHEP test proposed by Henze and his collaborators (Meckhn and Mund- 
from, 2004, 2005), whereas others obtained an opposite conclusion, favoring 
the Mardia test (Dufour et al, 2003). Bai and Ng, considering the fact that 
the sampling distributions of these coefficients is not well known for serially 
correlated data, proposed a strategy of generalization of the Jarque-Bera 
test in order to account for these problems (Bai, Ng, 2005, also Doornik and 
Hansen, 2008). 

Considering the stationarity of our series of returns, by construction, we 
are nevertheless confronted with evidence of serial and cross correlation. It 
is well known that daily returns tend to be negatively serially correlated, 
that their statistical significance tends to be greater in periods of unrest 
and that their cross correlation increases with volatihty (Pesaran, 2010). A 
novel approach to deal with this problem, considering serial correlation in 
the residuals of overlapping observations, was proposed recently by Pesaran 
and his colleagues, using a new version of a seemingly unrelated regression 
equations based estimation (Pesaran et al, 2011). Furthermore, there is evi- 
dence of cross correlation, which has been rarely discussed in the framework 
of multivariate analysis, with some exceptions (e.g., Richardson and Smith, 



Considering these suggestions, we checked our data applying the Mardia 
measure to the stocks in our population, for each period, in order to diagnose 
deviations from normality and to describe the dynamics of the market in 
different periods. In each case we proceeded to systematic comparisons with 
the measures of series of random data obtained from a Gaussian distribution 
with the same average and variance as in our population. 

Mardia's test of multivariate normality is performed in order to determine 
if the null hypothesis of multivariate normality is a reasonable assumption 
regarding the population distribution of a random sample. 

Mardia proved that, under the null hypothesis, the statistics 



is asymptotically distributed as a standard normal. Since our aim is not 
only to test the data but essentially to distinguish the periods of business- 



1993). 



&2,p(t)-(p^+2p) 



(7) 
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as-usual from the periods of crises and, in that sense, to test the distortions 
in the dynamics of the markets, we replace the expected vahie and standard 
deviation used in Mardia's standardization with the empiric counterparts 
as obtained from the observed values of the statistics in a business- as-usual 
period. In this sense, we consider business- as-usual periods 1973-1995 for 
the longer series and January 2005 to June 2007 for the shorter series. The 
modified statistics is then 



9it) = ''%t::ir 

where b2,p(t) and a{b2,p(t)) axe the estimated values of, respectively, the 
mean and the standard deviation. In this case, our variables are the six 
coordinates identifying the relevant dimensions, which represent the relevant 
information about the market. This is a robust result, unaltered even when 
other dimensions are considered, confirming our previous result indicating 
that such dimensions essentially represent noise. Even if the Mardia test - 
or other tests on multivariate normality - cannot be applied to a very large 
number of variables, since the asymptotic properties are not known for those 
cases, this strategy allows both for considering a large population (236 and 
471 firms) and to test the described dynamics of the market, considering all 
the available information and not just a single index averaging through the 
market. 

5 Results and Discussion 

In previous empirical work, we found a robust result: markets of different 
sizes, ranging from 70 to 424 stocks across different time windows (from one 
year to 35 years), and using different market indexes for different markets, 
may be described by six effective dimensions (Araiijo and Louga, 2007, 2008). 
A striking characteristic of the data is also that, whenever the market suffers 
a crash, there is a contracting effect provoking a clear distortion in the dom- 
inant directions of the market space. If the volume expands whenever the 
cloud of points represents a situation of business- as-usual and the market 
space is similar to that of a random universe, whenever a crisis occurs the 
volume of the geometric object severely contracts, leading to the emergence 
of characteristic and distorted shapes. 
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Figure 3: The evolution of g(t) from 2005 to 2009. 



Figure 3 describes the evolution of g(t) for 2005-2009 and Figure 4 the 
same statistic for the larger period of 1973-2009, and the typical limits of 
the interval corresponding to a level of significance of 5% are indicated. The 
hypothesis of normality is clearly rejected in the periods of turbulence, as 
expected. 

In Figure 5 the results of the test for random data with the same standard 
deviation and expected value as the true data are presented, highlighting by 
comparison the presence of structure in the market described by our data. 



6 Conclusion 

The multivariate analysis has rarely been applied to stock market series. In 
this paper, we propose a novel approach to that application, which permits 
the consideration of the evolution of a large population of stocks for different 
periods and describes the deformations in the geometry of the market, while 
discussing some of the technical difficulties involved in this enterprise. This 
is possible since the evolution of the distances among all the firms is encap- 
sulated by a small manifold of coordinates in the market space. Although 
this presents some statistical difficulties we are addressing in the current re- 
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Figure 4: The evolution of g(t) from 1973 to 2009. 

search, this avenue favors a new approach to the multidimensional objects 
constructed by the dynamics of complex markets and interactions among 
many firms and decisions. 

As it was previously found, during periods of financial turbulence the 
shape of the market changes dramatically, whereas in periods of normal 
business it resembles the spherical form typical of a random distribution. 
Our data confirms those results. A moment, kurtosis, is used in order to 
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Figure 5: The evolution of g(t) corresponding to random data with the same 
mean and stardard deviation as in Figure 3 (l.h.s.) and Figure 4 (r.h.s.) 
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measure the distortions of the distribution, and the results highlight how 
the distances among stocks contract in periods of unrest. Given this, the 
hypothesis of well behaved random distribution of the stock returns can be 
challenged and should be challenged. It does not correspond to the state of 
nature during a crisis or an episode of turbulence in the financial markets we 
investigated. 
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